About Me

My current research lies at the intersection of generative modeling, multimodal AI, and visual reasoning, with a particular focus on diffusion-based models and their applications in computer vision.

I completed my Bachelorโ€™s degree in Electrical and Computer Engineering at National Yang Ming Chiao Tung University, where I am now continuing my journey as a Ph.D. candidate.

๐Ÿ“ Publications

  • Score Replacement with Bounded Deviation for Rare Prompt Generation
    ๐Ÿชถ Bo-Kai Ruan, Zi-Xiang Ni, Bo-Lun Huang, Teng-Fang Hsiao, Hong-Han Shuai
    ๐ŸŒ Preprint 2025
    One-liner: Adaptive switching from frequent to rare prompt for rare concept generation
    [Paper]
  • Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback
    ๐Ÿชถ Yi-Lun Wu, Bo-Kai Ruan, Chiang Tseng, Hong-Han Shuai
    ๐ŸŒ In Neural Information Processing Systems (NeurIPS) 2025
    One-liner: Preference optimization requires with preferred data only
  • Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation
    ๐Ÿชถ Sung-Lin Tsai, Bo-Lun Huang, Yu Ting Shen, Cheng Yu Yeo, Chiang Tseng, Bo-Kai Ruan, Wen-Sheng Lien, Hong-Han Shuai
    ๐ŸŒ In ACM International Conference on Multimedia (ACM MM) 2025
    One-liner: Color embeddings can be represented by text embeddings
    [Paper]
  • TF-TI2I: Training-Free Text-and-Image-to-Image Generation via Multi-Modal Implicit-Context Learning in Text-to-Image Models
    ๐Ÿชถ Teng-Fang Hsiao, Bo-Kai Ruan, Yi-Lun Wu, Tzu-Ling Lin, Hong-Han Shuai
    ๐ŸŒ In International Conference on Computer Vision (ICCV) 2025
    One-liner: Image features are text features within MM-DiT
    [Paper] [Code] [Project Page]
  • MAD: Makeup All-in-One with Cross-Domain Diffusion Model
    ๐Ÿชถ Bo-Kai Ruan, Hong-Han Shuai
    ๐ŸŒ In Conference on Computer Vision and Pattern Recognition Workshop (CVPRW) 2025
    One-liner: All makeup tasks are domain transfer tasks
    [Paper] [Code] [Project Page]
  • Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References
    ๐Ÿชถ Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai
    ๐ŸŒ In AAAI Conference on Artificial Intelligence (AAAI) 2025
    One-liner: Painterly harmonization is to add contents but preserve style
    [Paper] [Code]
  • Modeling Uncertainty for Low-Resolution Facial Expression Recognition
    ๐Ÿชถ Ling Lo, Bo-Kai Ruan, Hong-Han Shuai, Wen-Huang Cheng
    ๐ŸŒ IEEE Transactions on Affective Computing (T-AC) 2023
    One-liner: Caring low-resolution facial expression recognition with uncertainty
    [Paper]
  • Mimicking the Annotation Process for Recognizing the Micro Expressions
    ๐Ÿชถ Bo-Kai Ruan, Ling Lo, Hong-Han Shuai, Wen-Huang Cheng
    ๐ŸŒ In ACM International Conference on Multimedia (ACM MM) 2022
    One-liner: Training the model to recognize expressions like how we do
    [Paper] [Code]

More can be found in my Google Scholar Page.

๐ŸŒŸ Services

Conference Reviewers

  • Neural Information Processing Systems (NeurIPS), 2025
  • International Conference on Learning Representations (ICLR), 2024-2025
  • AAAI Conference on Artificial Intelligence (AAAI), 2025
  • ACM International Conference on Multimedia (ACM MM), 2023-2025
  • IEEE International Conference on Multimedia (ICME), 2023-2024

Journal Reviewers

  • IEEE Transactions on Multimedia (T-MM)

๐ŸŽ“ Educations

  • 2023 - Present, ECE PhD, National Yang Ming Chiao Tung University, Taiwan
  • 2019 - 2023, ECE Undergraduate, National Yang Ming Chiao Tung University, Taiwan